KMID : 0387320200300010015
|
|
Korean Journal of Health Policy and Administration 2020 Volume.30 No. 1 p.15 ~ p.25
|
|
A Study on the Application of Natural Language Processing in Health Care Big Data: Focusing on Word Embedding Methods
|
|
Kim Han-Sang
Chung Yeo-Jin
|
|
Abstract
|
|
|
While healthcare data sets include extensive information about patients, many researchers have limitations in analyzing them due to their intrinsic characteristics such as heterogeneity, longitudinal irregularity, and noise. In particular, since the majority of medical history information is recorded in text codes, the use of such information has been limited due to the high dimensionality of explanatory variables. To address this problem, recent studies applied word embedding techniques, originally developed for natural language processing, and derived positive results in terms of dimensional reduction and accuracy of the prediction model. This paper reviews the deep learning-based natural language processing techniques (word embedding) and summarizes research cases that have used those techniques in the health care field. Then we finally propose a research framework for applying deep learning-based natural language process in the analysis of domestic health insurance data.
|
|
KEYWORD
|
|
Health care big data, High dimensionality, Deep learning, Natural language processing, Word embedding, Word2vec
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|